NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

SpecDiff-2: Scaling Diffusion Drafter Alignment For Faster Speculative Decoding

Sandler, Jameson; Christopher, Jacob K; Hartvigsen, Thomas; Fioretto, Ferdinando (April 2026, Conference on Machine Learning and Systems (MLSys))

Speculative decoding has become the standard approach for accelerating Large Language Model (LLM) inference. It exploits a lossless draft-then-verify procedure to circumvent the latency of autoregressive decoding, achieving impressive speed-ups. Yet, current speculative decoding approaches remain limited by two fundamental bottlenecks: (1) the autoregressive dependency during drafting which limits parallelism, and (2) frequent rejections of draft tokens caused by misalignment between the draft and verify models. This paper proposes SpecDiff-2, a novel framework to jointly address these two bottlenecks. It leverages discrete diffusion as a non-autoregressive drafter to address bottleneck (1) and develops novel techniques to calibrate discrete diffusion drafters with autoregressive verifiers, addressing bottleneck (2). Experimental results across a comprehensive benchmark suite show that SpecDiff-2 achieves a new state-of-the-art across reasoning, coding, and mathematical benchmarks, improving tokens-per-second by up to an average of +55% over previous baselines and obtaining up to 5.5x average speed-up over standard decoding, without any loss of accuracy.
more » « less
Full Text Available
Training-Free Constrained Generation With Stable Diffusion Models

Zampini, Stefano; Christopher, Jacob K; Oneto, Luca; Anguita, Davide; Fioretto, Ferdinando (December 2025, Advances in Neural Information Processing Systems (NeurIPS))

Full Text Available
Constrained Discrete Diffusion Models

Cardei, Michael; Christopher, Jacob K; Hartvigsen, Thomas; Bartoldson, Brian R; Kailkhura, Bhavya; Fioretto, Ferdinando (December 2025, Advances in Neural Information Processing Systems (NeurIPS))

Full Text Available
Simultaneous Multi-Robot Motion Planning with Projected Diffusion Models

Liang, Jinhao; Christopher, Jacob; Koenig, Sven; Fioretto, Ferdinando (July 2025, International Conference on Machine Learning (ICML))

Full Text Available
Simultaneous Multi-Robot Motion Planning with Projected Diffusion Models

Liang, Jinhao; Christopher, Jacob; Koenig, Sven; Fioretto, Ferdinando (July 2025, International Conference of Machine Learning (ICML))

Full Text Available
Neuro-Symbolic Generative Diffusion Models for Physically Grounded, Robust, and Safe Generation

Christopher, Jacob K; Cardei, Michael; Liang, Jinhao; Fioretto, Ferdinando (May 2025, International Conference on Neuro-symbolic Systems (NeuS 2025))

Full Text Available
Neuro-symbolic Generative Diffusion Models for Physically Grounded, Robust, and Safe Generation

Christopher, Jacob K; Cardei, Michael; Liang, Jinhao; Fioretto, Ferdinando (May 2025, International Conference on Neuro-symbolic Systems (NeuS))

Full Text Available
Multi-Agent Path Finding in Continuous Spaces with Projected Diffusion Models

Liang, Jinhao; Christopher, Jacob; Koenig, Sven; Fioretto, Ferdinando (February 2025, The 6th International Workshop on Multi-Agent Path Finding, at AAAI, 2025)

Full Text Available
Physics-Aware Diffusion Models for Micro-structure Material Design

Christopher, Jacob K; Baek, Stephen; Fioretto, Ferdinando (January 2025, AI For Material Science workshop, at NeurIPS-24)

Full Text Available
Constrained Synthesis with Projected Diffusion Models

Christopher, Jacob K; Baek, Stephen; Fioretto, Ferdinando (January 2025, Advances in Neural Information Processing Systems (NeurIPS))

Full Text Available

« Prev Next »

Search for: All records